Modelling energy flow in the vocal tract with applications to glottal closure and opening detection
نویسندگان
چکیده
The pitch-synchronous analysis that is used in several areas of speech processing often requires robust detection of the instants of glottal closure and opening. In this paper we derive expressions for the flow of acoustic energy in the lossless-tube model of the vocal tract and show how linear predictive analysis may be used to estimate the waveform of acoustic input power at the glottis. We demonstrate that this signal may be used to identify the instants of glottal closure and opening during voiced speech and contrast it with the LPC residual signal that previous authors have used for this purpose.
منابع مشابه
Steady Flow Through Modeled Glottal Constriction
The airflow in the modeled glottal constriction was simulated by the solutions of the Navier-Stokes equations for laminar flow, and the corresponding Reynolds equations for turbulent flow in generalized, nonorthogonal coordinates using a numerical method. A two-dimensional model of laryngeal flow is considered and aerodynamic properties are calculated for both laminar and turbulent steady flows...
متن کاملGlottal source processing: From analysis to applications
The great majority of current voice technology applications rely on acoustic features, such as the widely used MFCC or LP parameters, which characterize the vocal tract response. Nonetheless, the major source of excitation, namely the glottal flow, is expected to convey useful complementary information. The glottal flow is the airflow passing through the vocal folds at the glottis. Unfortunatel...
متن کاملAnalyzing the effect of secondary excitations of the vocal tract on vocal intensity in different loudness conditions
For voiced speech the main excitation of the vocal tract occurs at the end of the glottal closing phase when the rate of change of the flow reaches its absolute maximum. This study presents a straightforward method that yields a numerical value to characterize the effect of the main excitation on vocal intensity. The method, Energy Ratio by Modified Excitation (ERME), takes advantage of the glo...
متن کاملRegulation of glottal closure and airflow in a three-dimensional phonation model: implications for vocal intensity control.
Maintaining a small glottal opening across a large range of voice conditions is critical to normal voice production. This study investigated the effectiveness of vocal fold approximation and stiffening in regulating glottal opening and airflow during phonation, using a three-dimensional numerical model of phonation. The results showed that with increasing subglottal pressure the vocal folds wer...
متن کاملModelling of Human Glottis in VLSI for Low Power Architectures
The Glottal Source is an important component of voice as it can be considered as the excitation signal to the voice apparatus. Nowadays, new techniques of speech processing such as speech recognition and speech synthesis use the glottal closure and opening instants. Current models of the glottal waves derive their shape from approximate information rather than from exactly measured data. Genera...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999